Automatic Annotation in Multirelational Information Networks

نویسنده

  • Richard Barber
چکیده

Many networks are completely encapsulated using a single node type and a single edge type. Often a more complicated model composed of multiple distinct node and edge types can be constructed to create a more informative network [2]. We call the former homogeneous networks and the latter heterogeneous. The ability to homogenize networks varies wildly, dependent on the network under analysis and the problem being solved. We present a class of networks multirelational information networks where the heterogeneous structure is necessary for performing node classi cation and detecting missing information in the network. A multirelational information network is a network G with nodes V that map to real world objects and concepts which we call entities, and with edges E which represent relations between these entities. We use nodes and edges interchangeably with entities and relations. For the multirelational networks we will consider, there exist a true vertex labeling function lv : V → 2\{}, where Σ is an alphabet of node types and an observed vertex labeling function l̂v : V → 2. Informally, entity types will exist in a hierarchy and we require that the labeling only map a vertex to a set of types T such that for any pair (ti,tj), i 6= j, ti is a descendant of tj in the hierarchy or vice versa. Similarly, there exists labeling functions l̂e and le for the edges, but we will not be exploring the existence of an edge hierarchy in this work. Our goal is to learn the true vertex labeling function from the observed multirelational information network and labels. Speci cally, given network G = (V,E), hierarchy T , and observed labeling functionl̂v, we will learn the true labeling function lv. Additionally, we predict missing data elds in nodes that have incomplete relations. Namely, the edgeset E that we observe is not the true and complete set of relations that exist in the world. For example, the Mustang entity and the Camaro entity have many features in common, but only the Camaro has the model years relation. We would like to automatically recommend relations that may exist for a node based on our estimation of the true set of relations E∗. More precisely, given G = (V,E), we would like to provide A : V− > E s.t. A takes a node v and outputs a subset of edges that it should be incident upon v. The heterogeneous structure of the network is vital to this task, as we are essentially determining the missing pieces of the heterogeneous structure. Being able to discover entity types and missing relations will provide a method for database maintainers to discover and correct missing information in their entities. We can directly present our algorithm to database providers to automatically discover incompleteness in their data. We will provide a speci c example of usefulness in 2.1.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Tags Re-ranking Using Multi-level Features in Automatic Image Annotation

Automatic image annotation is a process in which computer systems automatically assign the textual tags related with visual content to a query image. In most cases, inappropriate tags generated by the users as well as the images without any tags among the challenges available in this field have a negative effect on the query's result. In this paper, a new method is presented for automatic image...

متن کامل

Fuzzy Neighbor Voting for Automatic Image Annotation

With quick development of digital images and the availability of imaging tools, massive amounts of images are created. Therefore, efficient management and suitable retrieval, especially by computers, is one of themost challenging fields in image processing. Automatic image annotation (AIA) or refers to attaching words, keywords or comments to an image or to a selected part of it. In this paper,...

متن کامل

A CAD System Framework for the Automatic Diagnosis and Annotation of Histological and Bone Marrow Images

Due to ever increasing of medical images data in the world’s medical centers and recent developments in hardware and technology of medical imaging, necessity of medical data software analysis is needed. Equipping medical science with intelligent tools in diagnosis and treatment of illnesses has resulted in reduction of physicians’ errors and physical and financial damages. In this article we pr...

متن کامل

Automatic Colorization of Grayscale Images Using Generative Adversarial Networks

Automatic colorization of gray scale images poses a unique challenge in Information Retrieval. The goal of this field is to colorize images which have lost some color channels (such as the RGB channels or the AB channels in the LAB color space) while only having the brightness channel available, which is usually the case in a vast array of old photos and portraits. Having the ability to coloriz...

متن کامل

Improvement of generative adversarial networks for automatic text-to-image generation

This research is related to the use of deep learning tools and image processing technology in the automatic generation of images from text. Previous researches have used one sentence to produce images. In this research, a memory-based hierarchical model is presented that uses three different descriptions that are presented in the form of sentences to produce and improve the image. The proposed ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011